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Designing appropriate expression vec- 
tors is one of the critical steps in the 
generation of stable cell lines for recom- 
binant protein production. Conventional 
expression vectors are severely affected 
by the chromatin environment sur- 
rounding their integration site into the 
host genome, resulting in low expression 
levels and transgene silencing. In the 
past, a new generation of expression vec- 
tors and different strategies was devel- 
oped to overcome the chromatin effects. 
Bacterial artificial chromosomes (BACs) 
are cloning vectors capable of accommo- 
dating up to 350 Kb. Thus, BACs can 
carry a whole eukaryotic locus with all 
the elements controlling the expression 
of a gene; therefore, BACs harbor their 
own chromatin environment. Expression 
vectors based on BACs containing open 
or permissive chromatin loci are not 
affected by the chromatin surround- 
ing their integration site in the host 
cell genome. Consequently, BAC-based 
expression vectors containing the appro- 
priate loci confer predictable and high 
levels of expression over time. These 
properties make BAC-based expression 
vectors a very attractive tool applied to 
the recombinant protein production 
field. 

Commentary 

For some time, recombinant protein pro- 
duction in the mammalian system has 
been a central topic in biotechnology. 1 
Indeed, more than a hundred therapeu- 
tic proteins are produced nowadays in 
mammalian systems, and it is expected 



that this number will increase rapidly 
due to the development of new therapeu- 
tic antibodies. 2 Therefore, an enormous 
effort to improve the generation of mam- 
malian cell lines expressing recombinant 
proteins has been carried on during the 
past decades. As with many biological 
processes, the generation of producer cell 
lines is hampered by several bottle-necks 
(transcription, translation, protein fold- 
ing, protein secretion, etc.). Indeed, one 
of the most important issues is the choice 
of the expression vector to generate the 
producer cell line. 1 In principle, an ideal 
expression vector should fulfill the fol- 
lowing criteria: (1) expression should be 
independent of the integration site into 
the host genome, (2) expression should 
not be silenced over time, (3) expression 
should be directly proportional to the 
number of integrated copies into the host 
genome and (4) expression levels should 
be high. The most conservative method 
to generate stable cell lines comprises the 
transfection of a plasmid containing a 
promoter which drives the expression of a 
gene of interest and an antibiotic marker 
for selection (Fig. 1A). Generation of sta- 
ble cell lines using these kinds of vectors 
results in unpredictable and generally low 
expression levels. Furthermore, expression 
from such vectors tends to be silenced over 
time. These disadvantages are due to the 
so called "positional chromatin effects." 3 
Transgene expression using such plasmid- 
based vectors is highly affected by the 
surrounding chromatin of their integra- 
tion site into the host genome. Thus, if 
the vector integrates in "silent chroma- 
tin" regions, its expression will be low or 
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Figure 1 . Strategies to generate stable cell lines. (A) Random integration of a plasmid-based vector: a conventional expression vector containing a 
promoter, a gene of interest (GOI), a polyadenylation signal (pA) and a selection marker (e.g., neomycin) transfected into cells integrates randomly into 
the host cell genome. These vectors are highly affected by the surrounding chromatin of their integration site, often resulting in low or no expres- 
sion and silencing of the transgene overtime. (B) Random integration of a plasmid-based expression vector flanked by "chromatin modifiers" (CM). 
The chromatin modifiers shield the expression vectors from the effects of the chromatin surrounding their integration site into the cell host genome. 
This results in better expression and stability of the transgene compared with (A). (C) Targeted integration of an expression vector into a chromatin 
permissive region (hot spot). By means of recombinase-mediated cassette exchange or somatic homologous recombination, an expression vector is 
targeted (integrated) to a hot spot known to be a permissive chromatin region. This results in predictable and stable expression of the transgene, but 
it is a more laborious method than described in (A) or (B). (D) Random integration of a BAC containing an expression vector. An expression vector is 
inserted into a 200 kb BAC containing an open chromatin locus (e.g., Rosa26 locus), transfected into the cells and randomly integrated into the host 
cell genome. The BAC-based expression vector carries on its own chromatin environment and it is not affected by the surrounding chromatin of its 
integration site. Several copies of the BAC-based vector can be co-integrated, thus resulting in high and stable expression levels of the transgene. 



suppressed at some time. This translates 
in the necessity of analyzing large num- 
bers of cell clones to find an optimal pro- 
ducer. Furthermore, several rounds of 
genomic amplification of the transgene 
using selection markers such us dihy- 
drofolate reductase (Dhfr) or glutamine 
synthetase (Gs) are needed to reach high 
levels of expression. 4 This makes the gen- 
eration of stable cell clones a tedious and 
time-consuming process. During the past 
years, researchers have developed several 
strategies to avoid the positional chro- 
matin effects and improve the efficacy of 
the expression vectors. One of the most 
popular approaches consists of flanking 
the expression vectors with cis-elements 
that modify the chromatin environment: 
"chromatin modifiers" (CM) (Fig. IB). 
The rationale behind this approach is 
that expression vectors containing CM 
are less affected by the detrimental effects 



of the chromatin surrounding the inte- 
gration site into the host genome. Thus, 
by placing CM in the expression vectors, 
less numbers of clones need to be ana- 
lyzed. In this regard, several CM have 
been described and successfully used for 
the establishment of recombinant cell 
lines, e.g., insulators, 5 matrix attachment 
regions, 6 chromatin opening elements 7 or 
antirepressor elements. 8 A second strategy 
consists of the identification of a genomic 
region on the host cell genome that allows 
reproducible and high expression levels of 
the transgene, the so-called "hot spots" 
or "chromatin permissive regions." Once 
an optimal hot spot has been identified, 
the expression vector can be knocked-in 
into such a pre-defined genomic region 
(Fig. 1C). This is usually achieved 
using "site-specific recombinase medi- 
ated cassette exchange" 5,10 or by stimu- 
lating homologous recombination into 



pre-predefined genomic regions using 
zinc-finger nucleases (ZFNs) 11 or tran- 
scription activator-like effector nucle- 
ases (TALENs). 12 The advantage of this 
method is that the expression levels of 
the transgenes are predictable. However, 
since this method introduces one single 
copy of the expression vector into the host 
genome, the expression levels may be lim- 
ited and genomic amplification may be 
needed to obtain higher/sufficient expres- 
sion. A third strategy is to use expression 
vectors with large cloning capacity able 
to accommodate a whole mammalian 
genetic locus. Such vectors will carry a 
pre-defined open chromatin locus and its 
own chromatin environment. Therefore, 
these vectors will not be affected at all by 
the surrounding chromatin of their inte- 
gration site (Fig. ID). 3 The most popular 
vectors with large cloning capacity are 
yeast artificial chromosomes (YACs) and 
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bacterial artificial chromosomes (BACs). 
YACs can accommodate up to 2 Mb, thus 
making them interesting candidates for 
expression vectors. 3 However, manipu- 
lation of YACs is very labor-intensive 
and time consuming; therefore, we do 
not consider YACs qualified for routine 
expression vectors for recombinant pro- 
tein production. BACs, circular plasmids 
maintained in Escherichia coli, are derived 
from the F-factor and are able to accom- 
modate up to 350 kb. BACs are easy to 
manipulate, and yields of purified BAC 
DNA are reasonable and can be trans- 
fected into mammalian cells using con- 
ventional methods, although due to their 
large size, transfection efficiencies are 
lower compared with smaller plasmids. 13 
Indeed, BACs can be handled almost like 
"conventional plasmids"; however, due to 
their large size, modification of the BACs 
cannot be done using standard clon- 
ing procedures (digestion with restric- 
tion enzymes, ligation, etc.) but using 
homologous recombination in E. coli 
(recombineering), a very simple and well- 
established method. 14 " 16 Furthermore, 
there are large BAC genomic DNA librar- 
ies annotated to the human and mouse 
genome available that greatly alleviate 
identification and obtainment or retrieval 
of a BAC containing a locus of interest. 17 
As mentioned above, the major advan- 
tage of using BACs as expression vectors 
for recombinant protein production is 
their large cloning capacity. BACs can 
contain an entire locus, including most 
(if not all) of the elements that regulate 
gene expression (promoters, enhancers, 
silencers, insulators, etc.). 18 Thus, BAC- 
based vectors carry on its own chromatin 
environment and they are not affected 
by the surrounding chromatin upon ran- 
dom integration into the host cell genome 
(Fig. ID). Therefore, BAC-based vectors 
confer copy-number dependent and chro- 
matin position independent expression, 
making BACs very attractive candidates 
for expression vectors. Indeed, due to their 
characteristics, BAC-based expression vec- 
tors have been vastly used during the past 
15 years in the transgenic mouse field; 1 '' 
surprisingly, only a few examples have 



been published using BACs as expression 
vectors applied to recombinant protein 
production in mammalian cells. 20 " 23 

A critical issue of using BACs as expres- 
sion vectors is the carried locus. Obviously, 
in order to exploit the full benefits of 
BACs as expression vectors they must 
carry an open chromatin locus. In this 
sense, BAC-based vectors can be consid- 
ered as the genetic equivalent to a knock- 
in of an expression vector in a genomic 
hot spot (Fig. 1C). The advantage is that 
the BAC-based vectors carry the hot spot 
themselves, thus making BAC-based vec- 
tors more flexible than the knock-in strat- 
egy. Furthermore, the expression levels of 
BACs are not limited to one integrated 
copy as it is in the case of the knock-in 
strategy. We have observed cell lines with 
at least 50 BAC copies integrated, thus 
initial expression levels are already high, 
avoiding the requirement for transgene 
amplification. 

In our initial studies, we used a 200 kb 
BAC containing the Rosa26 {Thumpd3) 
locus. 24 The Rosa26\oc\is is a region known 
to be open chromatin and widely used to 
express genes in transgenic mice. 25 By 
placing an expression cassette (promoter, 
gene of interest, polyadenylation signal 
and selection marker) into the Rosa26 
BAC we were able to obtain HEK293 
stable pools with a 10 x increase in pro- 
ductivity compared with the plasmid- 
based vectors. Furthermore, we observed 
that protein expression levels were directly 
proportional to the integrated BAC-copy 
number and that protein expression was 
stable for at least 30 passages. 21 Recently, 
using two anti-HIVl single-chain frag- 
ment crystallizable (scFc) antibody deri- 
vates of human IgGs as model proteins, 
we have shown that the Rosa26 BAC- 
based vector improves recombinant pro- 
tein production in DUKX-B11 CHO cells 
and, importantly, the expression levels 
are maintained over time. 23 These results 
illustrate the robustness of the BAC vec- 
tor system and its flexibility regarding the 
host cell line. Furthermore, our results 
have been elegantly confirmed by other 
researchers. A 175 kb BAC-based expres- 
sion vector containing the dihydrofolate 



reductase (Dhfr) resulted in copy num- 
ber-dependent and stable expression in 
NIH 3T3 cells. 20 Thus, other BAC-based 
expression vectors containing other loci 
than the Rosa26 can also be successfully 
used to establish cell lines. 

In conclusion, BAC-based vectors are 
not affected by the chromatin surround- 
ing their genomic integration site, their 
expression levels positively correlate with 
the integrated BAC copy numbers and 
they confer high and stable expression. 
Thus, BAC-based vectors satisfy the con- 
ditions of an ideal expression vector for 
recombinant protein production in mam- 
malian cells. Future developments may 
include the use of isogenic BACs to the 
host cell line. Indeed, the Rosa26 locus 
used in our BAC vectors is from murine 
origin and was used to generate cell lines 
in HEK293 and CHO cells from human 
and hamster origin, respectively. Although 
we have obtained good results using a 
non-isogenic BAC, it seems more reason- 
able to use isogenic BACs to the host cells, 
e.g., BACs containing genes from CHO 
origin in CHO cells. The choice of the 
BAC-based expression vector containing 
the Rosa26 locus was the consequence 
of an educated guess. The BAC-based 
expression vector containing the Rosa26 
locus is not necessarily the best option to 
generate CHO cell lines. Therefore, a very 
attractive approach will be to screen CHO 
cells to find genomic hot spots, retrieve 
the BACs encompassing such genomic 
locations and use such BACs as expression 
vectors. The establishment of BAC librar- 
ies from CHO origin and the availability 
of the CHO genome should facilitate this 
approach in the future. 26,27 
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